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Abstract. We describe a new methodology for studying persistence of topological fea- 
tures across a family of spaces or point-cloud data sets, called zigzag persistence. Building 
on classical results about quiver representations, zigzag persistence generalises the highly 
successful theory of persistent homology and addresses several situations which are not cov- 
ered by that theory. In this paper we develop theoretical and algorithmic foundations with 
a view towards applications in topological statistics. 



1. Introduction 

1.1. Overview. In this paper, we describe a new methodology for studying persistence of 
topological features across a family of spaces or point-cloud data sets. This theory of zigzag 
persistence generalises the successful and widely used theory of persistence and persistent 
homology [HI |13] . Moreover, zigzag persistence can handle several important situations that 
are not currently addressed by standard persistence. 

The zigzag persistence framework is activated whenever one constructs a zigzag diagram 
of topological spaces or vector spaces: a sequence of spaces Si, . . . , Sn where each adjacent 
pair is connected by a map Si S'j+i or Si ^ Si+i. The novelty of our approach is that 
the direction of each linking map is arbitrary, in contrast to the usual theory of persistence 
where all maps point in the same direction. 

This paper has three principal objectives: 

• To describe several scenarios in applied topology where it is natural to consider zigzag 
diagrams (Section [T|. 

• To develop a mathematical theory of persistence for zigzag diagrams (Sections |2] 
and [3]). 

• To develop algorithms for computing zigzag persistence (Section |4]). 
There is one subsidiary objective: 

• To introduce the Diamond Principle, a calculational tool analogous in power and 
effect to the Mayer-Vietoris theorem in classical algebraic topology (Section [s]). 

This is a theoretical paper rather than an experimental paper, and we devote most of our 
effort to covering the mathematical foundations adequately. The technical basis for zigzag 
persistence comes from the theory of graph representations, also known as quiver theory. We 
are deeply indebted to the practitioners of that theory; what is new here is the emphasis on 
algorithmics and on applications to topology (particularly Sections [l| |4] and [sj. 
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1.2. Persistence. One of the principal challenges when attempting to apply algebraic topol- 
ogy to statistical data is the fact that traditional invariants — such as the Betti numbers or 
the fundamental group — are extremely non-robust when it comes to discontinuous changes 
in the space under consideration. Persistent homology [SI IS] is the single most powerful 
existing tool for addressing this problem. 

A typical workflow runs as follows [6]. The input is a point cloud, that is, a finite subset 
of some Euclidean space or more generally a finite metric space. After an initial filtering 
step (to remove undesirable points or to focus on high-density regions of the data, say), a 
set of vertices is selected from the data, and a simplicial complex S is built on that vertex 
set, according to some prearranged rule. In practice, the simplicial complex depends on a 
coarseness parameter e, and what we have is a nested family {^ejeefcoo]; which typically 
ranges from a discrete set of vertices at 5*0 to a complete simplex at S^o- 

Persistent homology takes the entire nested family {S^} and produces a barcode or persis- 
tence diagram as output. A barcode is a collection of half-open subintervals [bj,dj) C [0, oo), 
which describes the homology of the family as it varies over e. An interval [bj, dj) represents 
a homological feature which is born at time bj and dies at time dj. This construction has 
several excellent properties: 

• There is no need to select a particular value of e. 

• Features can be evaluated by interval length. Long intervals are expected to indicate 
essential features of the data, whereas short intervals are likely to be artefacts of 
noise. 

• There exists a fast algorithm to compute the barcode [13j. 

• The barcode is a complete invariant of the homology of the family of complexes . 

• The barcode is provably stable with respect to changes in the input |4j. In contrast, 
any individual homology group iffc(S'e) is highly unstable. 

The major limitation of persistence is that it depends crucially on the family {5'^} being 
nested, in the sense that C S^' whenever e < e'. This apphes to the current theoretical 
understanding as well as the algorithms. Zigzag persistence addresses this limitation. 

If we discretise the variable e to a finite set of values, the family of simplicial complexes 
can be thought of as a diagram of spaces 

Si — > 5*2 — > ■ ■ ■ — > Sn 

where the arrows represent the inclusion maps. If we apply the /c-dimensional homology 
functor Hk{-¥) with coefficients in a field F, this becomes a diagram of vector spaces 

^1 - ^2 ^ K 

and linear maps, where Vi = Hk{Si] ¥). Such a diagram is called a persistence module. What 
makes persistence work is that there is a simple algebraic classification of persistence modules 
up to isomorphism; each possible barcode corresponds to an isomorphism type. 

Our goal is to achieve a similar classification for diagrams in which the arrows may point 
in either direction. This is zigzag persistence, in a nutshell. 

1.3. Zigzag diagrams in applied topology. We consider some problems which arise quite 
naturally in the computational topology of data. 

Example 1.1. Some of the most interesting properties of a point cloud are contained in 
the estimates of the probability density from which the data are sampled. Deep structure is 
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sometimes revealed after thresholding according to a density estimate (see [3] for an exam- 
ple drawn from visual image analysis). However, the construction of a density estimation 
function p invariably depends on choosing a smoothing parameter: for instance p{x) might 
be defined to be the number of data points within distance r of x; here r is the smoothing 
parameter. 

It happens that different choices of smoothing parameter may well reveal different struc- 
tures in the data; a particularly striking example of this occurs in |3]. Statisticians have 
invented useful criteria for determining what the 'appropriate' value of such a parameter 
might be for a particular data set; but another point of view would be to analyse all values 
of the parameter simultaneously, and to study how the topology changes as the parameter 
varies. 

The problem with doing this is that there is no natural relationship between, say, the 25% 
densest points as measured using two different parameter values. This means that one cannot 
build an increasing family of spaces using the change in parameters, and so one cannot use 
persistence to analyze the evolution of the topology. On the other hand, there are natural 
zigzag sequences which can be used to study this problem. Select a sequence of parameter 
values ri < r2 < ■ ■ ■ < r„ and a percentage p, and let X'^ denote the densest p% of the 
point cloud when measured according to parameter value r. We can then consider the union 
sequence 

xp ^ U ^ X'P ^ ux^ ^ xp ^ ■■■ ^ xp 

ri ri ^ r2 r2 r2 r-^ rg »*„ 

or the intersection sequence 

xp ^ xp nxp ^ xp ^ xp nxp xp < > xp . 

ri ri ' ' r2 r2 r2 rg rg r„ 



As we see in Section 5.3, there is essentially no difference between the zigzag persistent 
homology of the union and intersection sequences of a sequence of spaces. Here that assertion 
needs to be filtered through the process of representing the data subsets Xp as simplicial 
complexes. 

Example 1.2 (Topological bootstrapping). Suppose we are given a very large point cloud X. 
If it is too large to process directly, we may take a sequence of small samples Xi, . . . , X„ and 
estimate their topology individually, perhaps obtaining a persistence barcode for each one. 
How does this reflect the topology of the original sample X? On one hand, if most of the 
barcodes have similar appearance, then one might suppose that X itself will have the same 
barcode. On the other hand, one needs to be able to distinguish between a single feature 
detected repeatedly, and multiple features detected randomly but one at a time. If we detect 
n features in Xi on average, are we detecting n features of X with detection probability 1, 
or kn features with detection probability l/fc? 

Once again, there is a need to correlate features across different instances of the construc- 
tion. The union sequence comes to the rescue: 

Xi ^ Xi U X2 ^ X2 x„ 

In this case, the intersection sequence is not useful at the level of samples, because two sparse 
samples are unlikely to intersect very much. 

The approach in this example is analogous to bootstrapping in statistics, where mea- 
surements on a large data set are estimated by making repeated measurements on a set of 
samples. 
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Example 1.3. In computational topology, there exist several techniques for modelling a 
point cloud data set X by a simplicial complex S: the Cech complex, the Vietoris-Rips 
complex, the alpha complex [9j, the witness complex [B], and so on. The witness complex 
W{X; L), in particular, depends on the choice of a small subset of 'landmark' points L (Z X 
which will serve as the vertex set of S. Roughly speaking, a simplex a with vertices in L 
is included in W{X; L) if there is some x G X which witnesses it, by being close to all the 
vertices. 

How does the witness complex W{X] L) depend on the choice of landmark set? There 
is no direct way to compare W{X] L) with W{X] M) for two different choices of landmark 
sets L, M. However, it turns out that one can define a witness bicomplex W{X] L, M) which 
maps onto each witness complex. The cells are cartesian products cr x r, where a, r have 
vertices in L, M respectively. A cell a x r is included provided that there exists x & X which 
simultaneously witnesses a for W{X; L) and r for W{X] M). 

Given a sequence Li, . . . ,Ln of landmark subsets, one can then construct the biwitness 
zigzag: 

W{X; Li) ^ W{X; L,, L^) ^ W{X- L2) < > W{X; L^) 

Long intervals in the zigzag barcode will then indicate features that persist across the cor- 
responding range of choices of landmark set. 

The fundamental requirement is then for a way of assessing, in a zigzag diagram of vector 
spaces, the degree to which consistent families of elements exist. The point of this paper 
is that there is such methodology. We will interpret the isomorphism classes of zig-zag 
diagrams as a special case of the classification problem for quiver representations (see [7j for 
background on this theory). There turns out to be a theorem of Gabriel pU] which classifies 
arbitrary diagrams based on Dynkin diagrams, and which shows in particular that the set 
of isomorphism classes of zigzag diagrams of a given length is parametrised by barcodes — 
just as persistence modules are. Long intervals in the classification define large families of 
consistent elements, hence indicate the presence of features stable across samples, landmark 
sets, or parameter values for a density estimator. 

1.4. Organisation of the paper. In Section [2] we describe the theory of decompositions 
of zigzag modules. These decompositions produce zigzag persistence barcodes analogous to 
the barcodes of persistent homology. The foundational theorem of Gabriel is stated without 
proof. In Section [3] we develop the machinery of right-filtrations, which turn out to be the 
right tool for accessing the decomposition structure of a zigzag module. This is an important 
section for the reader who wishes to make serious use of zigzag persistence. In Section |4| we 
present a general-purpose algorithmic framework for calculating zigzag persistence, and we 
show how this operates in a practical class of examples. The algorithm is based on a proof of 
Gabriel's theorem for zigzag modules, included for completeness. Section [5] is devoted to a 
localisation principle which gives another approach to zigzag barcode calculations. We apply 
this to prove the Diamond Principle. We use this in turn to compare the zigzag barcodes 
for two natural zigzag diagrams obtained from a sequence of simplicial complexes. 

2. Zigzag Diagrams of Vector Spaces 

We work over a field F which remains fixed throughout this paper. There is no significance 
to the choice of F. All vector spaces are finite-dimensional. 
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2.1. Zigzag modules. Let V denote a sequence of vector spaces and linear maps, of length n: 



Each represents either a forward map — ^ or a backward map The object V is 

called a zigzag diagram of vector spaces, or simply a zigzag module, over F. 

The sequence of symbols f or g is the type of V. For instance, a diagram of type r = fgg 
looks like this: 

The length of a type r is the length of any diagram of type r. For example, we say that fgg 
has length 4. We will usually be considering zigzag modules of a fixed type r of length n. 
Such diagrams are called r-modules, and the class of r-modules is denoted rMod. 

Persistence modules (see [8|,|13j) are zigzag modules where all the maps have the forward 
orientation; in other words, where r = //... /. As explained in [13], persistence modules 
can be viewed as graded modules over the polynomial ring F[t]. This observation simplifies 
the analysis of persistence modules quite considerably. 

More generally, one can consider graph representations of arbitrary oriented graphs. 
Zigzag modules constitute the special case where the graph is An (a path with n vertices and 
n — 1 edges) and the orientation is specified by the type r. In 1972, Gabriel showed that the 
Dynkin-Coxeter graphs An, Dn, Eg, Ej, Eg (arbitrarily oriented) have an especially well- 
behaved representation theory [1^. The theory of quivers was launched from this starting 
block; see [7] for a beautiful and transparent introduction. Zigzag persistence is enabled by 
the good behaviour of An graph representations. 

Remark. rMod has the structure of an abelian category. Given two r-modules V, W, a 
morphism a : V — > W is defined to be a collection of linear maps ai : Vi ^ Wi which satisfy 
the commutation relations ai+ifi = hiai or aigi = fcjaj+i for each i. (Here the forward 
and backward maps for W are written h,k respectively.) Morphisms can be composed in 
the obvious way, and have kernels, images, and cokernels: for instance IK = Ker(a) is the 
r-module with spaces Ki = Ker(Vi Wi) and maps and gilxi+i defined by restriction. 
The set of morphisms Hom(V, W) is naturally a vector space over F, and the endomorphism 
ring End(V) = Hom(V, V) is a non-commutative F-algebra. We can view End(V) as the 
subalgebra of End(Vi) x ■ ■ ■ x End(V^) defined by the commutation relations. 

2.2. Decompositions of zigzag modules. We wish to understand zigzag modules by 
decomposing them into simpler parts. Accordingly, a submodule W of a r-module V is 
defined by subspaces Wi < Vi such that fiiWi) C Wi+i or gi(Wi+i) C Wi for all i. These 
conditions guarantee that W is itself a r-module, with maps given by the restrictions fi\wi 
or gi\wi+i- We write W < V. 

A submodule W is called a summand of V if there exists a submodule X < V which is 
complementary to W, in the sense that Vi = Wi (B Xi for all i. In that case, we say that V 
is the direct sum of W, X and write V = W © X. 

Example 2.1. As a rule, most submodules are not summands. V = (F F) has the 
submodule W = (0 — > F). However, W is not a summand because the only possible 
complement is (F — > 0), and that is not a submodule of V. 
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Remark. The direct sum can also be defined as an 'external' operation: given r-modules 
V, W their direct sum V © W is defined to be the r-module with spaces Vi (BWi and maps 
fi © hi or QiQki. (Here the forward and backward maps for W are written h, k respectively.) 

A r-module V is decomposable if it can be written as a direct sum of nonzero submodules, 
and indecomposable otherwise. Any r-module V has a Remak decomposition; in other 
words we can write V = Wi © ■ ■ ■ © Wat, where the summands Wj are indecomposable. The 
existence of such a decomposition is proved by induction on the total dimension dim(Vi): 
if V is decomposable, say V = W © X, then we may assume inductively that W, X have 
Remak decompositions, and therefore so does V. (Base case: if V is indecomposable, then 
it has a Remak decomposition with one term.) 

Remak decompositions themselves are not unique. However, the KruU-Schmidt principle 
from commutative algebra tells us that the summands in a Remak decomposition are unique 
up to reordering: 

Proposition 2.2. (Krull-Remak-Schmidt.) Suppose a r-module V has Remak decomposi- 
tions 

V = Wi©---©Wa,/ and V = Xi © ■ ■ ■ © Xat. 
Then M = N and there is some permutation a o/ {1, . . . , A^} such that Wj = Xo-(j) for all j . 

Proof. The proof of Theorem 7.5 of Lang [12j, which is stated for modules in the ordinary 
sense, can be applied verbatim to our present context; all the required algebraic operations 
can be carried out within End(V). Since our r-modules have finite total dimension, the 
ascending and descending chain conditions (acc and Dcc) are automatic. □ 

For further context, we refer the reader to an elegant article by Atiyah [T]; the KruU- 
Schmidt principle applies in any exact abelian category to objects which satisfy ACC and 
DCC, or a weaker 'bi-chain condition' defined by Atiyah. Our category, rMod, is included 
by this formulation. 

Thus we can use the multiset {Wj} as an isomorphism invariant of V. For this to be 
useful, we need to identify the set of indecomposable r-modules. We now describe a natural 
collection of indecomposables. For each subinterval [b, d] of the integer sequence {1, . . . ,n} 
there is an associated r-module. 

Definition 2.3. Let r be a type of length n and consider integers 1 < b < d < n. The 
interval r-module with birth time b and death time d is written 1^(6, d) and defined with 
spaces 



and with identity maps between adjacent copies of F, and zero maps otherwise. When r is 
implicit, we will usually suppress it and simply write 1(6, d). 

Example. If r = fgg then 1(2, 3) is the zigzag module 




F 



if 6 < 2 < 
otherwise; 











F 



1 



F 







0. 



Proposition 2.4. Interval r-modules are indecomposable. 
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Proof. Suppose 1(6, d) = V© W and consider two adjacent terms F connected by an identity 
map. Since V, W are submodules, the dimensions of V and W cannot decrease in the direction 
of the map; nor, since they are complements, can they increase. Thus dim(Vi) and dim(PVj) 
are constant over b < i < d, and in particular one of V, W must be zero. □ 

Here is the foundation stone for the theory of zigzag persistence. 

Theorem 2.5 (Gabriel). The indecomposable r-modules are precisely the intervals l{b,d), 
where l<b<d<n = length(r). Equivalently, every r-module can be written as a direct 
sum of intervals. 

Proof. This is the simplest special case of Gabriel's theorem, for the graphs An- The original 
reference (in German) is |10]. See |7] for an accessible overview. □ 

Thus, any r-module can be described completely up to isomorphism as an unordered 
list of intervals [6, d\ , which correspond to its indecomposable summands. This is in exact 
accordance with the special case of ordinary persistence, where the result is comparatively 
easy to prove: it is simply the classification of finitely-generated graded modules over the 
polynomial ring F[t] (see [T3]). 

The philosophical point is that the decomposition theory of graph representations is some- 
what independent of the orientation of the graph edges (see Kac [H]). Even in our case this 
is surprising, because there is no obvious congruence between persistence modules and zigzag 
modules of an arbitrary type r. However, if we accept this principle, then the generalisation 
from ordinary persistence to zigzag persistence is not surprising: interval decomposition for 
persistence modules implies interval decomposition for zigzag modules. 



We will devote much of this paper to constructing a stand-alone proof of Theorem 2^ This 
provides technical support towards our two main goals: to provide algorithms for computing 
the interval summands of a given r-module; and to make rigorous statements about the 
output of those algorithms. 

2.3. Zigzag persistence. We now define zigzag persistence and develop some of its ele- 
mentary properties. 

Definition 2.6. Let V be a zigzag module (of arbitrary type). The zigzag persistence 

of V is defined to be the multiset 

Pers(V) = rf,] C {1, . . . , n} | j = 1, . . . , N] 

of integer intervals derived from a decomposition V = I(6i, di)® ■ ■ ■ ® II(&Ar, d^)- The KruU- 
Schmidt principle asserts that this definition is independent of the decomposition. 

Graphically, Pers(V) can be represented as a set of lines measured against a single axis 
with labels {1, . . . , n} (the barcode), or as a multiset of points in lying on or above the 
diagonal in the positive quadrant (the persistence diagram). See Figure [l] for an example 
presented in each style. 

Remark. In the special case of persistence modules, this agrees with the standard treatment 
(see |H1[I3]) except in the following particular: the closed integer intervals \bj,dj\ C {1, . . . , n} 
are replaced by half-open real intervals ^j-,dj + 1) C M in the standard treatment. This is 
particularly natural when the indexing parameter is continuous: an interval [6, d) indicates 
a feature born at time b that survives right up to, but vanishes at, time d. Our convention is 
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Figure 1. Barcode (left) and persistence diagram (right) representations of 
the persistence {[1,2], [1,3], [3,3], [3,4], [3,4]} of a zigzag module of length 4. 

motivated by the desire to maintain symmetry between the forward and backward directions. 
We advise the reader to take particular care in handling the different conventions. 

The transition from a zigzag module to its interval decomposition presents certain hazards 
which are not present in the case of persistence modules. We now draw attention to these 
hazards. 

Definition 2.7. Let V be a zigzag module and let V[p, q] denote the restriction of V to the 
index set p < i < q. A feature of V over the time interval [p, q\ is a summand of V[p, g] 
isomorphic to I(p, q). 

With persistence modules, there are several equivalent ways to recognise the existence of 
a feature. Here is a sample result. 

Proposition 2.8. Let Y be a persistence module of length n, and let 1 < p < q < n. The 
following are equivalent: 

(1) The composite map Vp ~^Vq is nonzero. 

(2) There exist nonzero elements Xi G Vi for p < i < q, such that Xj+i = fi{x,i) for 
p < i < q. 

(3) There exists a submodule ofY[p,q] isomorphic to I{p,q). 

(4) There exists a summand ofY[p,q] isomorphic to I{p,q), i.e. a feature over [p,q]. 

Proof. It is easy to verify that (1), (2), (3) are equivalent. For (1) =^ (2), begin by choosing 
Xp & Vp that maps to a nonzero element in Vq, and let Xi be the image of Xp in Vi. For 
(2) =^ (3), define I by Jj = Span(xj). For (3) =^ (1), note that the restriction Ip — > Ig is 
nonzero. 

Clearly (4) =^ (3). We now show that (1) =^ (4). Consider an interval decomposition 
^[^7 q] = 1(^1) di)Q)- ■ • ©I(6Ar, dN)- On each summand, the map Ip{bj, dj) Iqipj, dj) is zero 
unless bj = p and dj = q. Thus at least one of the summands is isomorphic to I{p, q). □ 



The intuitions supported by Proposition 2.8 break down in the general case 



Caution 2.9. Let V be a zigzag module of arbitrary type. Statement (1) has no clear 
interpretation at this stage (something can be said in terms of the right-filtration functor of 
Section |3]). Consider the following statements: 

(2) There exist nonzero elements G for p < i < g, such that Xj+i = /j(xj) or 

) (whichever is applicable) ioi p < i < q. 

(3) There exists a submodule of V[p, q\ isomorphic to I(p, q). 

(4) There exists a summand of V[p, g] isomorphic to I(p, g), i.e. a feature over [p, g]. 



It is easy to verify that (2) (3) and that (4) imphes (2), (3). However, the next two 
examples demonstrate that (2), (3) do not in general imply (4). 



Example 2.10. Let t — gf and consider the r-module V defined as follows: 

F ^ — f2 > F 
X < 1 {x,y) I > y 

The interval decomposition is V = 1(1, 2) ® 1(2, 3), where the summands are 

F < F©0 > 

X i 1 (x,0) 

and 

< 0®F > F 

(0,y) I > y 

respectively. If this example appeared in a statistical topology setting, the feature corre- 
sponding to the generator of the F at Vi would be regarded as unrelated to the feature 
corresponding to the generator of the F at V3. 

On the other hand, V does have a submodule (in fact, many submodules) isomorphic to 
1(1, 3). Indeed, let A = {(x, x) | a: G F} denote the diagonal subspace of F^. Then 

F i A > F 

X i 1 {x,x) I > X 

is a submodule W < V isomorphic to 1(1,3). The quotient r-module V/W is isomorphic 
to 1(2, 2) but W has no complementary r-module in V. Indeed, that would contradict the 
Krull-Schmidt theorem. More concretely, any complement of W must be isomorphic to 
(0 < — F — > 0), but that would require a 1-dimensional subspace of Ker(g'i) fl Ker(/2) = 0. 

Example 2.11. We can extend the previous example to arbitrary length. Consider the type 
T — gf . . . gf — {gf)"', of length 2n + 1. Let V be the r-module 

F^F^^F^^^i ^F^F^^F, 

where 7ri{x, y) — x, and 7r2{x, y) — y. Then V is isomorphic to a sum of short intervals 

1(1, 2) ® {1(2, 4) ® • • • ® I{2n - 2, 2n)} ® I{2n, 2n + 1) 

but it has a submodule 

¥ < — A — >¥ < >¥ < — A — ^F 

isomorphic to the long interval 1(1, 2n -|- 1). 

Moral. In zigzag persistence it is necessary to respect the distinction between submodules 
and summands. Features are defined in terms of summands; never submodules. 

We have defined features in terms of a chosen subinterval \p,q]. Features behave as ex- 
pected when zooming to a larger or smaller window of observation. The following proposition 
illustrates what we mean. 

Proposition 2.12. Let Y be a zigzag module of length n and let 1 < p < q < n. The 

following statements are equivalent. 
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(1) There exists a summand o/V[p, g] isomorphic to l{p,q), i.e. a feature over [p,q\. 

(2) There exists a summand ofY isomorphic to I{p',q'), for some [p',q'] ^ [p,(l]- 

Indeed, there is a bijection between intervals [p, q] in Pers(V[p, q]) and intervals [p', q'] ^ [p, q] 
in Pers(V). 

Proof. Consider an interval decomposition V = I{bi,di) © ■ ■ ■ © libNyd^)- By restriction, 
this induces an interval decomposition of V[p, q] into intervals 1(6^, dj)\p, q]. This induces the 
claimed bijection, because [bj,dj] restricts to [p,q] if and only if [bj,dj] ^ [p,q]. □ 

Operating invisibly in this proof is the KruU-Schmidt principle, which allows us to select 
the interval decompositions most convenient to us when calculating Pers(V) and Pers(V[p, q]). 

Remark. Sometimes it is useful to reduce the resolution of Pers(\^). Let K G {1, . . . ,n} he 
any subset. We define the restriction of Pers(V) to K to be the multiset 

Pers(V)|x = {lnK\I e Pers(V), / n ^ 0} . 



For instance. Proposition 2.12 amounts to the observation that Pers(V[p, g]) = Pers(V)|[p,g]. 



3. From Zigzag Modules to Filtrations 

3.1. The right-filtration operator. Our strategy is to understand (and construct) decom- 
positions of a r-module V by an iterative process, moving from left to right and retaining the 
necessary information at each stage. The bulk of this information is encoded as a filtration 
on the rightmost vector space Vn- 

Definition 3.1. The right-filtration R(V) of a r-module V of length n takes the form 

R(V) = iRo,Ri,...,Rn), 
where the Ri are subspaces of Vn satisfying the inclusion relations 

= Ro<Rl<-- - <Rn = Vn. 

R(V) is defined recursively as follows. 
Base case: 

• If V has length 1, then R(V) = (0, Vi). 
Recursive step. Suppose we have already defined R(V) = {Rq, Ri, . . . , Rn). 

. If V+ is V ^ K+i, then R(V+) = (/„(i?o), /n(i?l), • • • , fn{Rn), K+i). 

. If V+ is V ^ K+1, then R(V+) = {0, g-'{Ro), g~'iRi), . . .,g-\Rn)). 
To verify that R(V"'") in the two inductive cases is a filtration of the specified form, note that 
Ri < Ri+i implies that fn{Ri) < /n('"i+i) in the first case, and g~^{Ri) < in the 

second case. Moreover fn{Ro) = fn{0) = 0, and g~^{Rn) = 5'n^(K) = K+i- 

Example 3.2. Here are the right-filtrations for the two length-2 types: 

R{V^^V2) = iO,fi{V),V2) 
RiV.J^V^) = {0,g^\0),V2) 

10 




12^ 

Figure 2. Forward propagation of the right-filtration, illustrated for the four 
types //, fg, gf, gg of length 3. 

Example 3.3. Here are the right-filtrations for the four length-3 types: 



R(Vi 


/i. 


V2 


/2, 


Vs) 


= {0,f2fliV,)j2iV2),Vs) 




/i. 


V2 


92 




= {0,g2\0),g^'h{V,),V3 


R(V^i 


91 


V2 




V3) 


= {0j2gl\0)j2iV2),Vs) 




91 


V2 


92 




= iO,g2\0),g,'g^\0),Vs 



See Figure [2] for a schematic representation. 

Remark. In the examples above, it is not difficult to see that R(V) comprises all the subspaces 
of Vn that are naturally definable in terms of the maps pi. 

Each of the n subquotients Ri/Ri^i carries information dating back to some earliest Vj in 
the sequence of vector spaces. 

Example 3.4. The module Vi — ^ V2 has right-filtration (0, /i(Vi), V2). The first subquo- 
tient /i(Vi)/0 = fi{Vi) corresponds to vectors born at time 1 which survive to time 2. The 
second subquotient V2/ fiiVi) corresponds to vectors which appear only at time 2. 

Example 3.5. The module Vi V2 has right-filtration (0,(7^^(0),^). The first subquo- 
tient gi^{0) corresponds to vectors at time 2 which are destroyed when mapping back to 
time 1. The second subquotient V2/gi^{0) is isomorphic to 5'i(V2) and records those vectors 
which survive from time 2 back to time 1. 

Definition 3.6. The birth-time index b(r) = (61, 62, • • • , &n) is a vector of integers bi which 
indicate the birth-times associated with the subquotients Ri/Ri-i of the right-filtration of a 
r-module. This is defined recursively as follows. 
Base case: 

• If r is empty (i.e. V has length 1) then b(r) = (1). 

Recursive step. Suppose we have already defined b(r) = (61, 62, • • • , bn)'- 

• If r+ is rf then b(r+) = (61, . . . , 6„, n + 1). 

• If r+ is rg then b(r+) = (n + 1, 61, . . . , 
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Example 3.7. At length 2 we have b(/) = (1,2) whereas h{g) = (2, 1). This is consonant 



with the discussion in Examples 3^ and |3.5[ 

Example 3.8. Here are the birth-time indices for the types of length 3. 

b(//) = (1, 2, 3), Hfg) = (3, 1, 2), h{gf) = (2, 1, 3), Hgg) = (3, 2, 1). 

In summary, the information in a r-module V which survives to time n is encoded as a 
filtration R{Y) on V"„. The 'age' of the information at each level of the filtration (i.e. at each 
subquotient) is recorded in the birth-time index b(r). 

For a simplified but precise version of this last claim, we now calculate the right-filtrations 
of interval r-modules. In the filtration specified in the following lemma, Ji/Ji-i = F is the 
only non-zero subquotient, corresponding to the birth time 6,. 

Lemma 3.9. Let t he a type of length n, with b(r) = (6i, 62, • • • , ^n)- For each i = 1,2, . . . ,n, 
we have an isomorphism 

R{Ir{bi,n)) =d{i,n), 
where n) = ( Jq, Ji, ■ ■ ■ , Jn) is the filtration on F defined by 

Jo = ... = J,_, = 0- J, = . . . = J„ = F. 

Remark. We refer to the 3{b, n) also as intervals (but now in the category of filtered vector 
spaces). 

Proof. This is a straightforward calculation by induction on r. For the base case, r is empty 
and b(r) = (1). Then R(I(1, 1)) = (0,F) = ^J(l, 1) as claimed. Now suppose the result 
is known for r, with b(r) = (61,..., 6„). Suppose r"*" = rf or rg. In both cases, write 
b(r+) = (6+,...,6+^,). 

Case /: Suppose that 1 <i <n; then b'l = bi and therefore 

I,+ (6+,n + l) = (Ir{bi,n) ^¥). 
Writing Ril^-^bi, n)) = n) = ( Jq, Ji, . . . , J„), it follows that 

R(V(6+,n + l)) = {Jo,Ji,...,Jn,¥) = 3{i,n + l). 
For i = n + 1, we have b^_^_i = n + 1, and indeed 

R(I,+ (n + l,n + l)) =R((...) ^ F) = (0, . . . , 0, F) = a(n + 1, n + 1). 
Case g: Suppose that 2 < i < n + 1; then b^ = and therefore 

V(6+,n + l) = (I,(6,_i,n)^F). 
Writing R(Ir{bi-i,n)) = 3{i — = (Jq, Ji, ■ ■ ■ , Jn), it follows that 

R(V(6+, n + l)) = (0, Jo,Ji,..., Jn) = 3{i, n + 1). 
For i = 1, we have b\ = n ^ \ and then 

R(I,+ (n + l,n + l)) = R((...) ^F) = (0, F, . . . , F) = a(l, n + 1) 

as required. □ 

Thus the right-filtration (with the help of the birth-time index) distinguishes the different 
intervals 1(6, ra). It gives no information about intervals 1(6, rf) when d < n, since in those 
cases /„ = 0. 
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Example 3.10. Consider r = fgf, so b(r) = (61, 62, &3, ^4) = (3, 1,2,4) and in general 



In particular, 



2,4) 


= R(F 


^ F 


^ F 


^ F) 


= (0, 0, F, F, F) 


= a(2,4) 


3,4) 


= R(0 


— > ¥ 


^ F 


^ F) 


= (0, 0, 0, F, F) 


= a(3,4) 


1,4) 


= R(0 


— > 


< — F 


^ F) 


= (0, F, F, F, F) 


= a(i,4) 


4,4) 


= R(0 


— > 


< — 


— > F) 


= (0, 0, 0, 0, F) 


= a(4,4) 



which is in accordance with Lemma 13.91 

3.2. Decompositions of filtered vector spaces. We now consider filtered vector spaces in 
their own right, independently of the connection to zigzag- modules, and develop the theory 
of Remak decompositions. We will see later that this is the right tool for understanding 
Remak decompositions of zigzag modules. 

A filtered vector space of depth n is a sequence Ji = (i?o,-Ri, • • • ,Rn) of vector spaces, 
where = i?o < -Ri < ■ ■ ■ < Rn- The class of such objects is denoted Filt„. The right- 
filtration R(V) of any zigzag module V of length n belongs to this class, as do the intervals 



2{i, n) defined in Lemma 3.9 Indeed, if G Filt„ satisfies dim(i?„) = 1, then "31 is isomorphic 
to some n). 

Remark. Filt„ can be given the structure of a category in a natural way, but it is not quite 
an abelian category since morphisms do not generally have cokernels. 

A filtered vector space § = (5*0, 6*1, . . . , S^) is a subspace of 51 ii Si < Ri for all i. 
It is appropriate to consider a stronger notion of subspace when dealing with direct-sum 
decompositions: S is an induced subspace of 51 if there exists a vector subspace K < i?„ 
such that Si = RiCi K for all i. In that event, we write § = OlCiK. Note that K = RnCiK = 

We say that 01 is the direct sum of two subspaces, and write 01 = S (B 7 , if Ri = Si (B Ti 
for all i. We claim that §, 7 must be induced subspaces. Note that fl T„ = 0. For each i, 
then, Ri fl S'„ is a subspace of Ri which contains Si and meets Tj < T„ only at 0. It follows 
that RiH Sn = Si for all i. Thus § = Oin Sn, and symmetrically T = fl T„. 

The general form of a direct-sum decomposition is therefore 01 = (01(1 K) © (01(1 L). What 
are the requirements on K, L to make this a valid decomposition? The direct sum condition 
implies that Rn = K (B L as a vector space. Moreover, given a vector space decomposition 
Rn = K (B L, the further condition 



Ri = Span nK,Rin L) for all i 

is necessary and sufficient to guarantee 01= (Oi (^ K) (B (01 (^ L) . 

If = S © T, the two subspaces S, 7 are said to be complementary summands. The 
following fact radically simplifies the decomposition theory of filtered vector spaces. 

Proposition 3.11. Every induced subspace of a filtered vector space has a complementary 
summand. 
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Proof. We are given § = 'Jin K, and seek to construct T = (Tq, Ti, . . . , T„) such that 
= S © T. We proceed inductively. Since i?o = •S'o = we take Tq = 0. Now suppose we 
have chosen so that Rk = (B T^- In particular, Tj. (1 = 0. Then 

Tk n Sk+i < Tk nSn = (Tfc n Rk) n 5„ = n {Rk n 5„) = n 5^ = o. 

Thus Tk and S^+i are independent subspaces of Rk+i, and so Tk can be extended to a 
complement T^+i of S'^+i in -Rfc+i. This completes the induction. □ 

Corollary 3.12. The indecomposables in Filt„, are precisely the intervals Siiyu). Thus, 
every filtered vector space can be decomposed as a finite direct sum of intervals. 



Proof. By Proposition 3.11 , 3? has nontrivial summands if and only if i?„ has nontrivial 
vector subspaces; this happens exactly when dim(i?„) > 1. □ 

The dimension of 3^ G Filt„ is defined to be the vector of integers 

dim(3^) = (ci,C2, . . . ,c„), 
where Cj = dim(i?j/i?j_i) are the dimensions of the successive subquotients of the filtration. 

Proposition 3.13. Let 01 be a filtered vector space of depth n, with dim(3l) = (ci, C2, . . . , c^). 
For any decomposition of Jl into intervals, the multiplicity of3ii,n) is Ci. Thus: 

l<i<n 

Proof. Let rrii be the multiplicity of ^(i,n). Then, for all k, 

dim{Rk) =1711 + 1712-1 + mk 

by considering the contribution of each summand, whereas 

dim{Rk) = ci + C2 H h 

by considering the contribution of each subquotient Ri/Ri-i. This is possible only if rrii = Ci 
for alH. □ 

This concludes our tour of the decomposition theory for filtered vector spaces. Now we 
must leverage this to achieve a decomposition theory for r-modules. In one direction, the 
relationship is straightforward: 

Proposition 3.14. The right-filtration operation respects direct sums, in the sense that 

R(Vi © . . . © Vjv) = R(Vi) © ■ ■ ■ © R(V^) 
for T-modules Vi, . . . , Vat. 



Proof. This is proved by induction on r, following the recursive structure of Definition 3.1 
and using the standard facts 

(/l © ■ ■ ■ © /7v)(i?l © ■ ■ ■ © Rn) = fliRl) © ■ ■ ■ © fNiRN) 

and 

(^1 © ■ • • © gNy\Ri ®---®Rn) = 9i\Ri) © ■ ■ ■ © g^'iRN) 

from linear algebra. (For simplicity we are suppressing various indices here.) □ 
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However, what we need is a converse to Proposition 3.14 if the filtered vector space 
= R(V) can be spht as a direct sum 'Ji = CRi © • ■ • © CR^v, we would like to infer a 
corresponding splitting V = Vi © ■ • • © V^v of r-modules. In the following two sections we 
establish such a principle for a particular class: the 'streamlined' r-modules. 

3.3. Streamlined modules. We introduce a special class of r-module for which the right- 
filtration functor preserves all structural information. 



Definition 3.15. A r-module V is (right-) streamlined if each 
is surjective. 



is injective and each 



fi 



Similarly, we may say that a r-module V is left-streamlined if each — ^ is surjective anc. 
each is injective. We will not need to consider left-streamlined modules until Section [s 
By default, 'streamlined' will be taken to mean 'right-streamlined'. 

Example 3.16. Intervals 1(6, n) are streamlined (but not 1(6, (i) for d < n). Conversely, a 
streamlined r-module V with dim(V^) = 1 is necessarily isomorphic to some I{b,n). Indeed, 
dim(V^) is a non-decreasing sequence and therefore comprises some 6 — 1 zeros (where 1 < 
h < n) followed by n — 6 + 1 ones. The maps between the one-dimensional terms are injective 
or surjective, and therefore isomorphisms. 



Proposition 3.17. A direct sumY 
if each summand is streamlined. 



Vi © • • • ©Vat of t -modules is streamlined if and only 



f 



Proof. Each — ^ in V decomposes as / = /i ( 
is injective. Each in V decomposes as g 



■ ■ ■ ® fN and is injective if and only if each fj 
gi® ■ ■ ■ ® Qn and is surjective if and only if 



each Qj is surjective. 

The proof of the following lemma appears at the end of this section. 



□ 



Lemma 3.18 (Decomposition Lemma). Let Y be a streamlined r-module and let "Ji = R(V). 
For any decomposition CR = Si © ■ ■ ■ © Sat, there exists a unique decomposition Y = Wi © 
• ■ ■ © Wtv such that Si = R{Wj) for all j. 

Theorem 3.19 (Interval decomposition for streamlined modules). Let Y be a streamlined r- 
module of length n, and write dim(R(V)) = (ci, C2, . . . , c„) and b(r) = (6i, 62, ... , 6„). Then 
there is an isomorphism of r-modules 



Ql(6„n). 



l<j<n 



Proof. Let CR = R(V). By Proposition 3.13, there is a decomposition CR = ^i©- ■ - ©^tv, where 
the tJj are a collection oi N = ci-\- ■ ■ ■ + Cn intervals with ^(i, n) occuring with multiplicity q. 
Lemma 3.18 produces a decomposition V = Ii © ■ ■ ■ © Iat, with R(Ij) = 3j for all j. Each Ij 

and is therefore 
^ ^ ^ 1(6, 

a(^,n). 



is streamlined (Proposition 3.17) with maximum dimension dim((/j)„) = 1 
isomorphic to some 1(6, n) (Example 3.16). By Lemma 3.9, we must have I 
whenever 



It follows that the I, collection of 



Cl + 



with 



occuring with multiplicity q. 



+ c„ intervals 

□ 



We complete this chapter with a proof of the Decomposition Lemma. 
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Proof of Lemma \3 . 1 8 ^ We may assume that N = 2, since the general case follows by itera- 
tion. Accordingly, suppose that 31 = R(V) can be written in the form 01 = S (B 7; we must 
show that there is a corresponding decomposition V = W © X. We will argue by induction 
on n = length(r). 

The first step is to determine the splitting Vn = iy„©X„. In fact, the stipulation that S = 
R(W) and 7 = R(X) forces Wn = Sn and X„ = T„. If n = 1, then we are done. Otherwise, 
let V denote the truncation of V to the indices {1, . . . , n — 1} and let "Ji = R(V). We will 
shortly establish that = S © T induces a unique compatible decomposition 3? = S © T. The 
inductive hypothesis will then provide V = W © X, which combines with Vn = Wn © X„ to 
produce the desired decomposition V = W © X. That will complete the proof. 

Write = {Rq, . . . , Rn). There are two cases. 

Case injective. We can identify Vn-i with the subspace fn-i{Vn-i) = Rn-i of Vn- 
Thereupon we have 

3^ = (-Ro) Ri, ■ ■ ■ , Rn-l)- 
The unique splitting of compatible with Vn = Wn © X„ is 

Vn^l = (Rn-l n Wn) © {Rn-l H X„) = Sn-1 © T„_i. 

We must now verify that the induced subspaces S = 31 fl Sn~i and T = 3? fl T„_i give a valid 
decomposition 3^ = S © T of filtered vector spaces. This follows because Si = Rid Sn-i = 
RiH Sn = Si and similarly Ti = Ti, for all i < n; so Ri = Si Q) Ti = Si Q) % as required. 

Case ^— ^, surjective. Here we identify Vn-i as the quotient Vn/ ker((7„_i) = Rn/Ri- Under 
this identification, 

^ = {Ri/Ri^ R2/R1, • • • , Rn/Ri)- 
In splitting Vn-i = Wn-i © Xn-i we are compelled to take 

Wn-l = gn~liWn) = Sn/ Si, X„_i = fi'„_l(X„) = T„/Ti, 

which induce 

Si = fi'n-l('S'j+l) = Si^i/Si, Ti = Qn-liTi^i) = Tj+i/Ti, 

for the purported splitting 3? = § © T. To confirm that this is a genuine decomposition, we 
note from linear algebra that the twin facts 

Ri+i = S^+i © Ti+i, Ri = Si®Ti = n © (T,+i n Ri) 

imply that 

R,+i/Ri = {S,+i/Si) © (T,+i/Ti) 
as required. □ 



Remark. There is a high-level proof of Lemma 3.18 which in some sense is the natural 
explanation for the result. We outline this proof now. The first observation is that the 
transformation V — R(V) is a functor from rMod to Filt„: a morphism a : V — W induces 
a morphism R(a) : R(V) R(W). Indeed, R{a) is defined to be a„ : — Wn, one must 
check that this respects the filtrations on Vn and Wn- Being a functor, R defines a ring 
homomorphism End(V) — >■ End(R(V)). The second key fact is that this homomorphism is 
an isomorphism if V is streamlined (in general it is surjective). It is well known that direct- 
sum decompositions of a module can be extracted from the structure of its endomorphism 
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ring: direct summands correspond to idempotent elements of the ring. It follows that V and 
R(V) have the same decomposition structure. 



4. The Interval Decomposition Algorithm 



Here we describe the algorithm for determining the indecomposable factors of a r-module. 
We give three versions of the 'algorithm'. 



The first version, in Section 4.1, is not an algorithm but a proof that every r-module 



decomposes as a sum of interval modules (Theorem 2.5 ). Moreover, the structure of the proof 
makes it clear how to compute the interval decomposition (Theorem 4.1). The algorithms 
in the subsequent sections build on this. 



In Section |4.2| we describe an abstract form of the decomposition algorithm, using the 
language of vector spaces and linear maps. No consideration is given to how the spaces and 
maps are described and manipulated in practice. 



In Section [4^ we suppose that the maps fi,gi are presented concretely as matrices Mi,Ni 
with respect to a choice of bases for the vector spaces V^. We describe an algorithm which 
takes these matrices as input and returns the interval decomposition. 

4.1. The interval decomposition theorem. Our present goal is to give a somewhat con- 



structive proof of Theorem 2.5, which asserts that any r-module V is isomorphic to a direct 
sum of intervals I{b, d). We prove a stronger, more precise result, which explicitly determines 
the multiplicity of each interval within V. 

Some notation will help with the theorem statement. If 



then let 



V=(\/i^...^^K) 



denote the truncation of V to length /c, and let r[fc] denote its type (which is a truncation 
of r). 

Theorem 4.1 (Interval decomposition). Let Y be a r-module. For 1 < k < n, define 

ib1,bl...,bl) = Hr[k]). 

Writing = R(V[A;]), define 

i^k k k-. _ f dim(3^fc n Ker(/fc)) 

^ 1' \ dim(Difc) - dim(D^fc n Im(^fe)) 

(whichever is applicable) when k ^ n, and 

(c^c^,...,0=dim(3^„). 

Then 

V- c'^Iiblk). 

l<i<k<n 



Addendum 4.2. In the situation of Theorem 4-1 • write 



l5 • • • 5 ' fcy 



dim(D^fc) 



17 



for k 



, n, and conventionally define r""*"^ = for all i. Then 

h 



„k _ k+1 

i i 



rt — r: 



k+l 
i+l 



case 



case 



9fe 



for 1 < i < k < n. 



The decomposition strategy begins with the following lemma. The idea is to proceed 
from left to right along the complex, removing streamlined summands at each step. Having 
done this, the Remak decompositions of those summands can be determined by counting 



dimensions, as prescribed in Theorem |3.19 

Lemma 4.3. Let V = Vi . . . Vn be an irreducible r-module of length n. Then there 
exists a direct-sum decomposition 

V = © © ■ ■ ■ © V" 

where each is supported over the indices {1,2, 
range. 

The following picture illustrates the decomposition. 



, k} and is right- streamlined over that 



V 





= Vi 








© 


= Vi 


Pi 

i !> 


Vi 




© 

© 


= Vi 


Pi 
< > 


Vi 


^ Vi 



Pi 

i > 



VI' 



P2 

i >• 



VJ- 



P3 
i > 



Pn-1 
< > 



'2 ^ ' "3 

Each row (i.e. summand) is right-streamlined, and therefore amenable to analysis via the 
right-filtration functor. 

Proof. We proceed by induction on the length of V. The inductive statement is that 

Y[k] = © ■ ■ ■ © V^'~^ © W'^ 

where the V* are as in the theorem statement, and W'^ is itself right-streamlined. 

For the base case k = 1, there is nothing to prove: take = V[l]. Now suppose the 
inductive statement is established for k, and consider Y[k -\- 1]. This can be written 

V[k + 1] = (V^ © ■ ■ ■ © V^-i © W'^) ^ Vk+i 

= © ■ ■ ■ © © (W^ ^ Vk+i) 

where the rebracketing is permissible because all of the V terms terminate before time k, 
and therefore do not interact with The goal now is to rewrite (W'^ ^+i) as 

V'^ © W'^^"'^, where V'^ terminates at time k and both V'^ and W'^^"'^ are right-streamlined. 
The rightmost term of W'^' is Vk, so R(W'^) is a filtration on Vk- 

jk fk _ 



Case /: 



J'' is 14, so R(W'^) 

Vk+i- In other words fk 

)le 

18 



Vk 



Let § = R(W^) n Ker(/fc). 

Proposition 3.11 implies that S has a complement in R(W'^); say R(W'^) = S © T. This 



Vk+i- 

ik\ 



corresponds (Lemma 3.18) to a direct sum decomposition W*^ = V*^ © 
summands are streamlined (Proposition 

Vfc+i). To check that this works, note that fk is zero on (V'^ 



where both 

3.17\. This defines V^ and we set W'^+i = (W'^ ^ 

Ker(/fc) and is injective 



on the complementary subspace (W Thus V is a summand of Y[k + 1] terminating at 
time k, and 
Case g: 



iyfc+i jg streamlined. 



Vk+i- In other words gt : Vk+i 



Vk. Let § = R(W'=) n Im(^fc). 
Proposition 3.11 impl ies th at S has a complement in R(W'^); say R(W^) = S © T. This 
corresponds (Lemma 3.18| to a direct sum decomposition W'^ = © V'^, where both 
summands are streamlined (Proposition 3.17). This defines V^, and we set W'^^^ = (W^ 

Im((7fc) and misses 



Vfc+i). To check that this works, note that gk is surjective onto {W^)k 



the complementary subspace {Y'^)k- Thus is a summand of V[A; + 1] terminating at time k, 
and W^+^ is streamlined. 

This establishes the inductive step, so eventually 

V = Y[n] = © • • • © V"-^ © W" 

and we set V" = W" to finish the proof. □ 

Proof of Theorem 4jJ_' Write V = V"*^ © ■ ■ ■ © V" according to Lemma 4^, We now calculate 
the decomposition of each V'^ into intervals 1(6, k). Note that 

Y[k] = V'^ © Y''+'[k] © ■ ■ ■ © V"[A;]. 

We can write W'^ = V'=+^ © ■ ■ ■ © V", so then 

Jl^ = R(V'= © W^[fc]) = R(V^) © R(W'=[A;]) 



(using Proposition 3.14). This is a filtration on V"^ 

Suppose k < n. We note that W*^ is streamlined up to time k + 1, whereas V'^ is zero at 
time k + 1. The next map in the sequence is 



Vk © ^ 
In the first case, it follows that V^^ 



W' or V,' © W', 



- Ker(/fc) and therefore R(V'^) = n Ker(/fe). In the 
second case, is a complement to lm{gk) in Vk, so ^k = R(V^) © {'Jik H lm{gk)). Thus 



dim(R(V'^)) 



dim(3^, n Ker(/fe)) 

dim(3^fc) - dim{Jik n Im(^fc)) 



-l5 



^4) 



(whichever middle term is applicable). When k = n, moreover, we have 



dim(R(V")) = dim(3^„) = {d 



1 5 



Thus, at last. 



l<k<n 



l<k<n Kl<i<k 



using Theorem 3.19 to decompose the V . 

Proof of Addendum^^ Write {w\, . . . , w^) = dim(R(W^[A;])). Since "Jlk = R(V'=)©R(W^ 
we can take dimensions and obtain the formula 



□ 



l5 



'15 



.Cfc) + (Wl, 



19 



Note also that ^Rk+i = R(V[A; + 1]) = R(W''[fc + 1]). Moreover, W'' is streamlined up to 
time k + 1. It follows that 



k+l 



A;+l^ 



dim(R(W'^[A; + 1])) 



and therefore 



i 

i+1 



case / 
case g 



which is the desired formula. 



case / 
case g 



□ 



4.2. Abstract vector spaces. We now transcribe Theorem 4.1 as an abstract algorithm 
for determining the interval structure of a r-module V of length n. This algorithm will serve 
as a skeleton for the more concrete algorithms developed later. 

Algorithm 4.4. We proceed through k = 1,2, . . . ,n, computing the filtration CR^ = R(V[fc]), 
the birth-time index b(r[A;]), and the dimensions c'l iteratively. 

BEGIN 

Initialisation (A; = 1): 

(1) 3^i = (0,Vi). 

(2) b(r[l]) = (1). 

Iterative step (k = 1, 2, . . . , n — 1): 

(3) Calculate O^fc+i from Difc = (R^, R\, 



(4) Calculate b(r[A; + 1])) from b(r 



R^) using Definition 



3.1 



{fk{Ro)^ fk{Rf), • • • , fk{Rk), Vk+l) 

{(),g-^\Rl),g-,\R\),...,g-,\Rl)) 



case / 
case g 



{h\,h 



2i 



rk+U 

■ ' "k+lJ 



btk + 1] 



ik + i,b'i,...,bi 



using Definition 

case / 
case g 



3.6 



(5) Calculate {c^, . . . , c^) using the formula in Theorem 4.1 



(Ci, C2, 



5 '-k) 



dim(3^fc n Keiifk)) 

dim(3^fc) - dimCRk n lm{gk)) 



case / 
case g 



Alternatively, use the formula in Addendum |4.2 



„k 



k+l 



i+1 



case / 
case g 



Here (rj^, . . . , r^) = dim(3lfc). 
Terminal step (k = n): 

(6) Calculate (c^, ...,<)= dim(a^(V)). 
Print results: 

(7) For 1 < i < k < n, the interval , k) occurs with multiplicity c'^. 

END 
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Note that steps (3-5) have an '/' verson and a '(yf' version, depending on the direction of 
the map pk- 

This abstract algorithm does not specify how the filtered vector spaces R(V[/c + 1]) are 
stored, nor how the maps fk or (which are used in steps (3) and (5)) are represented. In 
any concrete setting, it is necessary to specify data structures. A good choice will facilitate 
the calculations in steps (3) and (5). In the next section, we work out the details in a simple 
scenario. 

4.3. Concrete vector spaces. In this section we describe an algorithm to solve the follow- 
ing concrete problem. Let r be a type of length n. We specify a r-module V as follows. Set 
Vi = F"* for integers a, > 0. For each i, the map is defined by an Oj+i-by-aj matrix Mj or 
else the map gi is defined by an Oi-by-aj+i matrix iVj. We are to determine Pers(V), given r 
and the matrices Mi or Ni. 

We follow Algorithm |4.4[ The substantial task is to calculate the sequence of right- 
filtrations = R(V[A;]), for step (3). Everything else is book-keeping: the birth-time 
indices 6^ are calculated according to step (4); and the filtration dimensions rf (and hence 
the c^) will be easy to read off from the stored description of the filtrations. 

Basis transformations. The algorithm operates on two levels. On the conceptual level, we 
proceed by modifying the bases of the spaces Vi by elementary basis transformations. Initially 
each basis Bj is the standard basis of F"\ We perform modifications on S2, CB3, . . . , !B„_i in 
sequence. On the pragmatic level, what we actually do is apply elementary row and column 
operations to the matrices Mj or Ni. We make no attempt to track the bases themselves; 
instead we implement the effect of those changes on the matrices. 

Suppose we apply elementary basis transformations to "Bk+i on the conceptual level. On 
the pragmatic level, we must perform 

row operations on or column operations on 

and simultaneously perform 

column operations on M^+i or row operations on Nk+i 

to enact those transformations. Thus, at every stage we must make parallel changes to two 
matrices simultaneously. Usually we are working to put Mk or A^^ in a particular form, and 
while doing so the changes have to be mirrored in M^+i or Nk+i (paying no attention yet to 
the structure of that matrix). 

We now make this precise. The elementary transformation Ei{p,q,X) is defined as 
follows. On the conceptual level, this is a modification of Bj = [jSi, . . . ,/5aJ involving basis 
vectors Pp and Pg-. 

Pp ^ /3p 

Pg ^ Pq + XPp 

On the pragmatic level, if L is a matrix representing a linear map Vi ^ W for some W (this 
will be A^i_i or Mj in our situation), then we modify the columns of L accordingly: 

ColumUp ^ ColumUp 

ColumUg <— ColumUg +A ColumUp 
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Else, if L represents a linear map of the form W ^ Vi (this will be M(_i or Ni in our 
situation) then we must apply the dual transformation to the rows of L: 



RoWp <— RoWp— ARoWg 
Rowq ^ RoWg 



In spirit, we right-multiply by the matrix 



to modify columns, or else left-multiply 



by the inverse matrix 



to modify rows. 



1 A 

1 -A " 
1 

Besides the elementary transformations g. A), it is sometimes appropriate to permute 
the basis elements. The operation Pi{p,q) of interchanging f3p with (3q is realised pragmati- 
cally by interchanging Column^ with Column^, or RoWp with RoWg, as appropriate. 

Filtrations. The filtration "Jl^ = R-(V[/c]) on is to be represented as follows. We require 
the basis = . . . , /3a.) to be compatible with the filtration, in a sense that will become 
clear. Assuming such a basis, the filtration "^ik = {Ro,Ri, . . . ,Rk) is represented as a non- 
decreasing function 

(f)k : {1,2, ...,ai} ^ {1,...,A;} 

so that 

Ri = Span {Pp I (j)k{p) < i} 

for i = 1, . . . , k. In other words: the first few basis elements (those (3p with (pkip) = 1) form 
a basis for Ri; the next few basis elements extend this to a basis for R2, and so on. The 
dimension rf = dim(_Rj/i?j_i) can be read off as the cardinality of 0^^(i). 

Gaussian elimination. Step (3) boils down to the following task. Suppose that and (pk 
together represent the filtration 3^^; then modify "Bk+i and determine (pk+i to represent ^k+i- 
We now explain how to do this. 

Case M: the matrix Mk represents a linear map Vk —>■ Vk+i- We assume that "Bk is 
compatible with the filtration "Jik, and that (pk identifies the filtration. This gives a block 
structure 

Mk=[Ki K2 ■■■ Kk] 

where Ki gathers together the columns q with (pkiq) = i- Using row operations only, put Mk 
into (unreduced) row echelon form. This means: 

• Each of the top r rows contains a 1 (the pivot) as its leftmost nonzero entry. 

• Each pivot lies strictly to the left of the pivots of the rows below it. 

• The lowest ak+i — r rows are entirely zero. 

These row operations correspond to elementary operations Ek+i{p,q, X), and the effect of 
these operations is felt on the next matrix Mk+i or Nk+i, which must be modified accordingly. 
We now define (pk+i as follows: 

4>k{q) if row p has a pivot in column q, 

k + 1 if row p has no pivot. 



0fc+i(p) = 

See Figure [3] It is evident in the figure that maps onto Ri^^ for all i. 
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Figure 3. Using row echelon form to compute ^k+i from 



4 3 2 1 



Figure 4. Using column echelon form to compute "Jik+i from Jl^.. 



Case N: the matrix represents a linear map Vk+i Vk- We assume that is 
compatible with the filtration Jlk, and that <pi identifies the filtration. This time we have a 
vertical block structure 



Lk 



where Li gathers together the rows q with (pkiq) = i- Using column operations only, put 
Nk into the column echelon form defined as follows (this echelon form begins on the bottom 
left): 

• Each of the leftmost r columns contains a 1 (the pivot) as its lowest nonzero entry. 

• Each pivot lies strictly lower than the pivots of the columns to the right of it. 

• The rightmost at+i — r rows are entirely zero. 

These column operations correspond to elementary operations Ek+i{p, q, A), and the effect of 
these operations is felt on the next matrix Mk+i or A^^+i, which must be modified accordingly. 
We now define (pk+i as follows: 



Jk+l 



(pkiq) + 1 if column p has a pivot in row q, 

1 if column p has no pivot. 



See Figure [ij It is evident in the figure that -R^+i is the largest subspace which maps into 
Ri, for all i. 

This concludes our treatment of the concrete form of the zigzag algorithm. 

23 



5. Further Algebraic Techniques 

5.1. Localization at a single index. Let V be a zigzag module of length n and let 1 < A; < 

n. We consider the problem of determining the set of intervals in Pers(V) which contain /c, 
without necessarily computing Pers(V) itself. We shall see that all the necessary information 
is contained in a pair of filtrations on the vector space Vk- 

Definition 5.1. Let V be a zigzag module of length n. The left-filtration of V is a filtration 
on Vi of depth n, defined as 

L(V) = R(V) 

where V is the reversal of V; so = with maps fi = Qn-i or gi = fn-i- 

For any k we therefore have two natural filtrations on V^: 

Oik = {Ro,Ri,...,Rk) = R(V[1,A;]), 
£fc = {Lo,Li, . . . ,Ln+i-k) = L(Y[k,n]); 

the right-filtration over the index set {!,..., fc} and the left-filtration over the index set 
{k, . . . , n}. We also have birth-time and death-time indices 

bfc = = b(r[l,A;]) 

dfc = {di,.. . ,dn+i-k) = n + l- h{f[k,n]) 

which indicate the birth and death times associated with the respective subquotients of Jik 
and Lfc. These depend on the type r of V. 

Example 5.2. Consider the zigzag module 
At k = 2, for instance, we have 

% = (0,/l(Fl),y2) 

^2 = {0,f2\0)J^'g,{V,),V2) 

and 

b2 = (1,2) 
d2 = (2,4,3). 

We can now state the main theorem of this section. 

Theorem 5.3 (Localization at index k). Let Y be a zigzag module of length n and let 
1 < k < n. Let ^k,^k denote the right- and left-filtrations at k, and let bfc,dfc denote 
the birth-time and death-time indices at k. Then, for all i,j in the range 1 < i < k, 
l<j<'"- + l — ^; the multiplicity of [hi, dj] in Pers(V) is equal to 

Cij = dim(_Rj n Lj) — dim(_Rj_i fl Lj) — dim(i?j fl Lj_i) + dim(_Rj_i fl Lj_i). 

Remark. Equivalently, Cjj = dim((i?j fl Lj)/((i?j_i fl Lj) + (i?j fl Lj_i))), the dimension of 
the (i, j)-th bifiltration subquotient. 

This theorem answers the original question, because every interval containing k can be 



written as [hi, dj] for some choice of i,j. We now work towards a proof of Theorem 5.3 



Proposition 5.4. R is sufficient to prove Theorem \5.3\ in the special case where V is right- 
streamlined over {1, . . . ,k} and left- streamlined over {k, . . . ,n}. 

24 



Proof. It is clear from Lemma 4^ that we can write V = U © W where U is supported in 
{1, . . . , k — 1} and W is right-streamhned over {1, . . . , k}. Indeed, take U = V"*^ © ■ • ■ © V'^""^ 
and W = V'' © ■ ■ ■ © V". Moreover, it is sufficient to prove Theorem |5.3| for W, because 



the ffitrations ^^^^k remain unchanged from V, and the discarded term U decomposes into 
intervals which do not contain k. Thus, we may assume that V is right-streamlined over 

Repeating this argument from the other side, we may further assume that V is left- 
streamlined over {k, . . . ,n}. □ 



Proof of Theorem \5.3[ Assume that V satisfies the condition in Proposition 5^ It follows 



that every interval in Pers(V) contains k: any other interval in the decomposition would cause 
a failure of the streamline condition. We can therefore write the interval decomposition of V 

as 

where A indexes the summands, and p : A {1, . . . ,k} and q : A {1, . . . ,n — k + 1} 
identify the interval type of each summand in terms of the birth-time and death-time indices. 
It is apparent from this formulation that 

Cij = #{a e A I p{a) = i, q{a) = j} 

and it remains to compute this in terms of the dimensions dim(i?j fl Lj). 

The interval decomposition restricts at index to a direct sum decomposition of Vk into 
1-dimensional subspaces Ua, generated by elements Xa, say. Then 

Olfc = R(IJ1, A;]) = 31, n f/, ^ 3{p{a), k) 

a<^A aeA a<=A 



where the final isomorphism comes from Lemma 3.9 Now, the filtration subspace Ri is 
spanned by the terms isomorphic to S{p, k) with p < i. In other words, for i = 1, . . . ,k we 
have 

Ri = Span {xa \ p{a) < i} ■ 
A similar argument proceeding from the other direction gives the analogous formula 

Lj = Span{xa | q{a) < j} , 

ioY j = 1, . . . ,n + 1 — k. Since the Xa are independent, these formulas give bases for Ri, Lj. 
We now claim that 

Ri n Lj = Span{xa | p{a) < i, q{a) < j} 

for all The inclusion Span C Ri r\Lj is obvious, because each of the spanning vectors Xa 
belongs to both Ri and Lj. In the other direction, if x e -Rj fl Lj then write x = J2aeA '^a^a- 
Since x G Ri, all the coefficients with p{a) > i must be zero. Since x G Lj, all the 
coefficients with g(a) > j must be zero. Thus x G Spanjxa | p{a) < i, q{a) < j}. This 
establishes the reverse inclusion Ri fl Lj C Span and hence the equality. 
Then 

i j 

dim{R, n Lj) = | p{a) < i, q{a) <j} = ^^ Cpg 

p=l q=l 

for all The formula in the theorem follows easily from this. □ 
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Remark. The salient fact behind this result is that it is possible to find a direct sum de- 
composition of Vfc which simultaneously decomposes the filtered spaces D^^, into intervals 
within their respective categories Filt^, Filt„+i_/;. Here we achieved this by appealing to 
the interval decomposition of V, but this can also be proved directly for an arbitrary pair 
of filtrations on a single vector space. The analogous statement for a triple of filtrations is 
false. For example 

(0,F©0,F2), (0,0©F,F2), (0,A,F2), 
(where A = {{x,x) \ x e F}) cannot be simultaneously decomposed into intervals. 
5.2. The Diamond Principle. Consider the following diagram: 

V\ < > ■ ■ ■ < > Vk-l Vfc+1 < > ■ ■ ■ < > Vn 



Uk 

Let V"*" and V~ denote the two zigzag modules contained in the diagram: 

We wish to compare Pers(V"'") with Pers(V~), particularly with respect to intervals that 
meet {k — 1, k, k + 1}. This requires a favourable condition on the four maps in the middle 
diamond. 

Definition 5.5. We say that the diagram 

9k 



fk 



fk- 



is exact if Im(Di) = Ker(D2) in the following sequence 

Uk Vk-i © Vk+i Wk 

where Di{u) = gk-i{u) © fk{u) and D2{v © v') = fk-i{v) - gk{v')- 

Theorem 5.6 (The Diamond Principle). GivenY^ andY~ as above, suppose that the middle 
diamond is exact. Then there is a partial bijection of the multisets Pers(V"'") and Pers(V~), 
with intervals matched according to the following rules: 

• Intervals of type [k, k] are unmatched. 

• Type [b, k] is matched with type [b, k — 1] and vice versa, for b < k — 1. 

• Type [k, d] is matched with type [k + 1, d] and vice versa, for d> k + 1. 

• Type [b,d] is matched with type [b,d], in all other cases. 
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Figure 5. Interval matching between Pers(V^) and Pers(V ): (top row) the 
five cases where matching occurs; (bottom row) unmatched intervals [k, k]. 
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Figure 6. From Pers(V+) to Pers(V~), for n = 6, = 4: points in the 
persistence plane move according to the arrows; the multiplicity of the point 
marked Kl changes unpredictably. 



It follows that the restrictions Pers(V+)|/^, Pers(V )\k to the set K = {1, . . . ,n}\ {k} are 
equal. 



Figures [5] and [6] illustrate Theorem 5^ in terms of barcodes and persistence diagrams, 
respectively. 

Remark. The I{k,k) summands in Pers(V^) span the cokernel of D2, whereas the I{k,k) 
summands in Pers(V~) span the kernel of Di. The hypothesis of Theorem 5.6 does not bring 
about any relation between these spaces (which is why the [k,k] intervals are unmatched). 
In Section 5.3, however, we consider a situation in which the [k, k] intervals can be tracked. 



We use the localization technique of Section 5J^ to prove Theorem 5^ We begin with 
birth- and death-time indices. 

Proposition 5.7. LetT^,T^ denote the zigzag types of Y~^,Y~ respectively. If we write 

(61, . . . , 6fc„i) = b(r+[l. A; - 1]) = b(r- [1, k - 1]) 
for the birth-time index up to time k — 1, then 

b(r+[l,A; + l]) = (A; + l,6i,...,6fc_i,A;). 

b(r-[l,A; + l]) = {k,bi,...,bk-i,k + l), 
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Similarly, if we write 

= d{T+[k + l,n]) = d{T-[k + l,n]) 
for the death-time index from time k + 1, then 

d(r+[A; - = {k - l,di, . . . ,dn-k,k). 

d{T~[k-l,n]) = {k,di,. . . ,dn-k,k - 1), 

Proof. This is immediate from the recursive definition of birth-time index. If we write tq = 
r+[l, k — 1] = T^[l,k — 1] then r+[l, k + 1] = r^fg and t~[1, A; + 1] = T^gf . The death-time 
index is treated similarly. □ 

Here is the crux of the matter: 



Lemma 5.8. In the situation of Theorem \5.6 , the following filtrations are equal: 

R(V+[1,A; + 1]) = R(V-[1,A; + 1]) 
L(V+[A;- l,n]) = L(V" [A; - 1, n]) 

Proof. Write {Rq, Ri, . . . , Rk-i) = R(V+[1,A; - 1]) = R(V-[1,A; - 1]). By the recursive 
formula (Definition 3.1), 

R(V+[1, k + l]) = (0, g,'fk-iiRo), . . .,g^'fk-i{Rk-i), V,+,) 

and 

R(V-[1, k + l]) = (0, fkg^\{Ro), fk9l\{Rk~i), Vk+i). 
Thus we can prove the first statement of the lemma by showing that 

fk9',\{R) = 9,' fk-i{R) 

for any subspace R < Vk-i- We use first-order logic. Let x G Vk+i- We have the following 
chain of equivalent statements. 

X e fk9k\iR) 

^ {3z G R) {By G Uk) {{9k-i{y) = z) k {fk{y) = x)) 
^ i3z e R) i3y e Uk) (Diiy) = z ® x) 
^ {3z e R){z®x e Im(Di)) 

On the other hand: 

X e g^^fk-iiR) 
^ i3z G R) ifk-iiz) = gk{x)) 
^ {3z e R){z®x e Ker{D2)) 

Since Im(Di) = Ker(D2) by hypothesis, it follows that fk9k\{R) = 9k^ fk-i{R)- 

This proves the first equality. The second equality follows symmetrically. □ 



Proof of Theorem 5.6. We adopt the notation of Section 5.1, and consider the right- and 
left-filtrations at V^+i, for both V+ and V~. Since Y^[k + 1, = V~[fc -|- 1, n] we have 



and by Lemma 5^ we have 

1?+ 
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"^k+l ~ "^k+l- 



Finally, b^^^ agrees with b^_,_^ except that k,k + 1 are interchanged, according to Proposi- 
tion 5.7 Thus, when we use Theorem 5.3 to calculate the multiplicity of [b, d] for b < k + 1 < 
d, there is perfect agreement between V"*" and V~ except that we must interchange k,k + 1 
when they occur as birth-times. 

A symmetrical argument can be made, localizing at \4_i. When we compute the multi- 
plicity of [b, d] for b < k — 1 < d, there is perfect agreement between V+ and V~ except that 
we must interchange k,k — 1 when they occur as death-times. 

We have covered all cases of the theorem except for intervals which meet neither k — 1 
nor k + 1. Intervals contained in [1, /c — 2] are automatically the same for V"^ and V~ 
because they can be computed by restricting to V+[1,A; — 1] and V~[1,A; — 1], which are 
equal. Similarly, intervals contained in [k + 2, n] are the same for and V~, by restricting 
to Y+[k + l,n] =Y-[k + l,n]. 

Finally, consider intervals [k, k]. Nothing can be said about those. □ 

5.3. The Strong Diamond Principle. The Diamond Principle can usefully be applied 
to the following diagram of topological spaces and continuous maps. The four maps in the 
central diamond are inclusion maps, and the remaining maps ^ are arbitrary. 

AUB 

Xi i — > ■■■ ^ Xk-2 < — > A B < — > Xk+2 ^ ■■■ < — > Xn 



AnB 

Let X+,X^ denote the upper and lower zigzag diagrams contained in this picture; so X"*" 
passes through AU B and X~, passes through Ad B. 

Theorem 5.9 (The Strong Diamond Principle). Given X"*" and X~ as above, there is a 
(complete) bisection between the multisets Pers(if*(X~^)) and Pers(if^,(X~)). Intervals are 
matched according to the following rules: 

• [k,k\ G Pers(if^+i(X+)) is matched with [k^k] G 'Pem{Hi{%r)). 
In the remaining cases, the matching preserves homological dimension: 

• Type [6, k] is matched with type [6, A; — 1] and vice versa, for b < k — 1. 

• Type [k, d] is matched with type [k + 1, d] and vice versa, for d > k + 1. 

• Type [b,d] is matched with type [b,d], in all other cases. 

Proof. For any i, apply the homology functor Hg to the diagram. The central diamond 

He{A) HeiAUB) 



He{A HB) ^ H,{B) 
is exact by virtue of the Mayer-Vietoris theorem, according to which 

... — > Hi{A nB) ^ He{A) © Hi{B) He{A U B) — > ... 

is an exact sequence. The Diamond Principle therefore applies to if^(X^) and Hg{li~), and 
we have a partial bijection which accounts for all intervals except those of type [k, k]. 
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Now consider the connecting homomorphism in the same Mayer- Vietoris sequence: 

By exactness, d induces an isomorphism between the cokernel of D2 and the kernel of Di. 
But the [k,k] summands of Pers(if^+i(X"'")) precisely span Coker(D2); whereas the [k,k] 
summands of Pers(if^(X~)) span Ker(Di). This establishes the claimed bijection between 
the [k, k] intervals. □ 

Example 5.10. Let X = {Xi, . . . ,X„) be a sequence of simplicial complexes defined on a 
common vertex set. Suppose these have arisen in some context where each transition Xi to 
Xj+i is regarded as being a 'small' change. There are two natural zigzag sequences linking 
the Xi. 

The union zigzag, Xj: 

Xi U X2 • • • ■ ■ ■ Xn~l U Xn 



Xi X2 Xn-1 Xn 

The intersection zigzag, Xp: 

Xi X2 Xn-l Xn 



Xi n X2 ■ ■ ■ ■ ■ ■ Xn^i n Xn 

We can think of these as being indexed by the half-integers {1, 1^, 2, 2^, . . . ,n}. 

We can apply the Strong Diamond Principle n — 1 times to derive the following relation- 
ship between the zigzag persistence of the two sequences Pers(if^(Xn)) and PeTs{He(Ku)) . 
Restricting to the integer indices, there is a coarse equality: 

Pers(i/,(Xu)) I {!,...,«} = Pers(ff,(Xn))|{i,...,n} 

More finely, there is a partial bijection between Pers(if£(Xu)) and Pers(if£(Xn)). Intervals 
shift homological dimension by +1 (from the intersection sequence to the union 
sequence). Otherwise [b, d\ [b', d'] where {b, b'} is an unordered pair of the form {k^, k + 1} 
and {d,d'} is an unordered pair of the form {k,k^}; dimension is preserved. Figure [t] 
illustrates the complete correspondence as a transformation of the persistence diagram, for 
n = 5. 

Concluding Remarks 

We have presented the foundations of a theory of zigzag persistence which, we believe, 
considerably extends and enriches the well known and highly successful theory of persistent 
homology. Zigzag persistence originates in the work of Gabriel and others in the theory 
of quiver representations. One of our goals has been to bridge the gap between the quiver 
literature (which is read largely by algebraists) and the current language of applied and 
computational topology. To this end, we have presented an algorithmic form of Gabriel's 
structure theorem for An quivers, and have indicated the first steps towards integrating these 
ideas into tools for applied topology. 

There are several ways in which this work is incomplete. The most significant omission 
is an algorithm for computing zigzag persistence in a homological setting (as distinct from 
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Figure 7. From Pers(i^*(Xn)) to Pers(i/^,(Xu)), for n = 5: points in the 
persistence plane move according to the arrows; points marked © stay fixed 
and increase homo logical dimension by 1. 



the somewhat sanitised vector space algorithm described in Section 4.3). We address this 
gap in a forthcoming paper with Dmitriy Morozov [2], where we present an algorithm for 
computing the zigzag persistence intervals of a 1-parameter family of simplicial complexes 
on a fixed vertex set. 

We have made no effort in this paper to fiesh out the applications suggested in Section [T| 
There is often a substantial gap between the concrete world of point-cloud data sets and 
the ideal world of simplicial complexes and topological spaces. We intend to develop some 
of these applications in future work. Meanwhile, we have given priority to establishing the 
theoretical language and tools. The Diamond Principle is particularly powerful. In the 
manuscript with Morozov [2j, we show that the Diamond Principle can be used to establish 
isomorphisms between several different classes of persistence invariants of a space with a 
real-valued (e.g. Morse) function. In particular, we use zigzag persistence to resolve an open 
conjecture concerning extended persistence [5]. This supports our prejudice that zigzag 
persistence provides the appropriate level of generality and power for understanding the 
heuristic concept of persistence in its many manifestations. 
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